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Abstract 

Two  new  families  of  languages,  the  F(k)  and  U(k) 
languages,  are  Introduced   each  of  which  is,  in  some  sense, 
a  generalization  of  top-down  deterministic  languages.   This 
leads  us  to  new  characterizations  of  s-grammars  and  LL(1) 
languages.   We  include  a  characterization  of  the  unambiguous 
context-free  languages,  generalizations  of  the  equivalence 
relation  on  s-grammars  to  LL(k)  grammars,  and  a  summary  of 
the  non-closure  results  for  LL(k),  F(k)  and  U(k)  languages, 
and  it  is  shown  that  non-degenerate  hierarchies  exist  for 
the  families  of  F(k)  and  U(k)  languages. 


Ill 


Introduction 

A  survey  of  the  approaches  to  top-down  deterministic 
languages  has  been  given  in  Wood  (1969a),  however  since  that 
time  some  earlier  unpublished  work  by  Schorre  (1965)  and 
Tixier  (1967)  has  come  to  light.   The  aim  of  this  paper  is 
to  investigate  their  approach,  relating  it  to  the  LL(k) 
languages  of  Lewis  and  Stearns  (1968).   At  the  same  time  we 
take  this  opportunity  to  generalize  some  results  of  Korenjak 
and  Hopcroft  (I966)  and  to  include  a  survey  of  the  non- 
closure results  for  LL(k)  languages,  some  of  which  are  new. 

In  Section  2  we  introduce  separability,  in  Section  3 
f-separabillty  and  f-quasl-separability ,  in  Section  4  we  deal 
with  generalizations  of  the  equivalence  relation  for  s-grammars 
and  in  Section  5  are  found  the  various  non-closure  counter- 
examples . 
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1 .   Notation 

We  use  0  to  denote  the  empty  set.   A  granunar  G  Is  a  ^-tuple 

G  =  (N,T,S,P) 
where 

N        is  a  finite  set  of  nonterminal  symbols, 

T        is  a  finite  set  of  terminal  symbols 

S  in  N   is  the  sentence  symbol,  and 

P        is  a  finite  set  of  rules  (or  productions)  of  the 

form  X  H-  X,  X  in  N  and  x  in  (N  u  T)  .   Let  V  =  N  u  T, 
If  X  ->■  X  in  P  then  x  is  an  alternative  of  X. 

In  the  usual  manner  the  free  monoid  generated  by  a  set 
of  symbols  A,  is  denoted  by  A  ,  similarly  A"*"  =  AA  .   e  denotes 

the  empty  word.   We  have  the  binary  relations  => ,  =>,  => 

+    «  »  ^   "^   ^ 

or  more  usually   =>,  =>,  => ,  on  words  over  V  ,  which  define 

derivations  over  G.   The  language  generated  by  a  word  w  is 

the  set  {x  :  w  =>  X,  X  in  T  } ,  written  both  as  L(w)  or  w, 

in  this  paper.   The  language  generated  by  the  grammar  G, 

denoted  L(G),  is  L(S).   The  length  of  a  word  x  in  V   is 

denoted  by  |x|,  and  is  the  number  of  symbols  in  x,  |e|  =0. 

* 

For  k  >  0,  for  all  x  in  V   let  kjjc  be  x  if  |x|  <  k, 

otherwise  x^  where  |x,|  =  k  and  x  =  x.x^. 

We  say  a  grammar  is  admissible  if  for  all  X  in  V  there  exist 
derivations  S  =>  uXv  and  X  =>  x,  where  u,v  in  V  ,  x  in  T  . 
Henceforth  grammar  means  admissible  grammar. 
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Definition 

For  k  >  0,  X  in  N  is  said  to  be  LL(k)  if  for  all  u,  v,  v', 

I  T 

U-.  ,  U-,  ,  X,  X  ,   such  that 

S  =>  uXv  =>  uxv   =>  uu., 

S  =>  uXv'=>  ux'v  =>  UU-,  ,   and 

T 

k:u-,  =  k:u-,   then   x  =  x',   where 
UjU-,  ,u-,  in  T  ,   v,v'  in  V    and   X  ->•  x ,   X  ^  x'  in  P  . 
Similarly  a  grammar   G   is   LL(k)   if  for  all   X   in   N 
X   is   LL(k).   L  is  an  LL(k)  language  if  it  is  generated  by 
some  LL(k)  grammar.   Let  oL(k)  be  the  family  of  LL(k)  languages 

and  ^=   LJ  <^(k). 

all  k 

Definition . 

A  grammar  G  is  an  e-free  grammar  if  for  all  X  ->■  x  in  P, 
X  in  V  .   An  e-free  LL(1)  grammar  is   an  s-grammar , 
similarly  an  s-language  is  a  language  that  can  be  generated  by 
an  s-grammar. 

A  set  X  is  £-free  if  e   not  in  X. 

Definition . 

A  nonterminal  X  has  a  cycle  (or  is  cyclic)  if  there  exists 
a  derivation  X  =>  uXv,   u,v  in  V  ,   uv  7^  e .   If  uv  =  e  then 
X  has  a  loop.   A  grammar  G   has  a  cycle  (loop)  if  at  least 

one  nonterminal  in  G  has  a  cycle  (loop).   If  X ,  Y  in  N  have 

+  * 

cycles  and  there  exists  no  derivation  X  =>  uYv,   u,v  in  V  , 

then  X,  Y  are  said  to  have  disjoint  cycles. 
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2.   .  .nd   Context-Free  Grammars 

)  introduces  the  concept  of  separability  which 
is  further  explored  by  Tixier  (1967).   We  extend  the  notion  of 
separability  in  order  to  apply  it  to  arbitrary  context-free 
grammars.   By  "set"  we  will  mean  "a  set  of  words  over  some 
terminal  alphabet  T." 

Definition 

The  left  quotient  (right  quotient)  of  a  set  X  by  a  set  Y 
is  {u  :  vu  in  X,  v  in  Y}  ({u  :  uv  in  X,  v  in  Y})  denoted 
by  X\Y   (X/Y). 

Definition 

The  proper  left  quotient  of  a  set  X  by  a  set  Y  is 
{u  :  vu  in  X,  V  in  Y,  u  7^  e)    denoted  by   X  .\  Y .   Similarly 
we  define  proper  right  quotient  denoted  by  X/.Y 

Definition 

Let  (A,B)  denote  an  ordered  pair  of  sets.   We  say  (X,Y) 
is  separable  if  X  .\  X  n  Y/.Y  =  0.   Let 

I(X,Y)  denote  the  set  X  .\  X  n  Y/.Y  . 

Remarks 

1)  It  follows  that  if  e  is  in  a  set  X  then 

X  -  (e)  C  X.\X  and  X  -  {e}  C  X/.X   . 

2)  If  x  in  T"*"  then   x  .\  x  =  x/.x  =  0. 
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Definition 

A  nonterminal  X  Is  separable  If  for  all  X  ->  x  In  P,  where 

+  * 

X  Is  In  W   for  all  Y,Z  in  V   such  that  x  =  uYZv,   u,v  in  V  , 

(Y,Z)  is  separable.   A  grammar  is  separable  if  all   X  in  N 
are  separable. 

Lemma  1 

Given  a  grammar  G  such  that  for  some  X  ->■  uYZv  in  P 

(Y,Z)  is  separable  then  it  follows  that: 

+  *  ~  ~ 

(i)        for  all  Y  =>  yW,  y  in  V  ,  W  in  V,  (W,Z)  is  separable, 

(ii)       for  all  Z  =>  Uz,  z  in  V  ,  U  in  V,  (Y,U)  is  separable, 
(ill)      for  all  W  and  for  all  U  as  above,  (W,U)  is  separable. 

Proof: 

We  will  prove  (i)  in  detail,  (ii)  and  (ill)  follow  similarly. 
(1)        Assume  the  contrary,  then  I(W,Z)  i-   0   therefore 
there  exists  at  least  one  p  in  I(W,Z).   Thus  there  exists  p.. 
in  W  such  that  p-,p   is  in  W,   therefore  qp-,  and  qp-.p   are  in  Y 
for  some  q  in  y.   Therefore  p  is  in  1(Y,Z)  giving  a  contra- 
diction.  The  result  follows.  11 

Corollary  2. 

If  G  is  a  separable  grammar  then  for  all  X,Y  in  V  such  that 
there  exists  a  derivation  Z  =>  uXYv,   (X,Y)  is  separable. 

The  converse  of  Lemma  1  obviously  does  not  hold,  however 
we  do  have : 
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y  3 . 
Given  a      ar  G,  X  -►  uYZv  Is  in  P  and  for  all  Y  •>  wW 
in  P,  u,v,w  in  V  ,  Z,W  in  V,      <  is  separable  then  (Y,Z) 
is  separable. 

Definition 

We  say  an  n-tuple  (X,  , .  .  .  ,X  )   is  R  separable   for  n  >_  2, 
if  for  all  i,  1  <  i  <  n,   (X.  ,X._^,  , .  .  .  ,X  )  is  separable. 

Definition 

A  nonterminal  X  is  R  separable  if  for  all  X  -»-  X^...  X  in  P, 
n  >_  2,   (X  , .  .  .  ,X  )  is  R  separable.   A  grammar  is  R  separable 
if  all  X  in  N  are  R  separable. 

We  now  give  a  lemma  that  demonstrates  the  behavior  of  R 
separability  under  substitution. 

Lemma  ^ . 

Given  a  grammar  G  such  that  for  some  X  ->  X,  ...  X  in  P, 

n  >_  2,   (X  J  .  .  .  ,X  )  is  R  separable  then  it  follows  that: 

(i)        for  all  i,  1  <_  i  <  n,  for  all  X^  =>  yY,  y  in  V  , 

Y  in  V,  (Y,X^^-|^, .  .  .  ,X^)  is  R  separable. 

(ii)       if  for  some  i ,  1  <  i  <  r ,  X,  ->  Y,  ...  Y   is  in  P, 

—   —   '11      m 

m  >_  2    (Y,,...,Y  )  is  R  separable;  then 

(Y, , . . . ,Y  ,X.,,,...,X  )  is  R  separable, 
i      m'l+i      n 

Proof: 

(i)   follows  directly  from  Lemma  1. 

(ii)  Assume  the  contrary,  then  there  exists  a  J,  1  £  J  £  m, 

such  that  (Y ,  ,Y  ..,,...  ,Y  X..,  ,...,X  )  is  not  separable.  Let 

J   j+l      m  i+l     '  n  ^ 
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V  denote  Y .  , -,  ...  Y   and  x  denote  X..-,  ...  X  .   This  implies 
•^  j  +  1      m  1+1      n  "^ 

there  exists  a  q  in  I(Y.,yx),  a  p  in  Y .  and  an  r  in  yx  such 

J  J 

that  pq  is  in  Y .  and  qr  is  in  yx .   Now  r  =  r-iTp  where  r^ 
in  y ,  rp  in  x . 

(a)  qr  =  qr-,  r^   where  qr.  in  y,  v^    in  x. 

If  I  rp  I  _>  I  rp  I   let   rp  =  Tp-j^rp   then  rp-,  in  l(X.,x) 
as  both  pqr-,  in  Y.y   and  pqr-,  in  Y.y.   If  |  rp  |  <  |  rp  |  a 
similar  argument  holds. 

(b)  qr  =  q-nqpr  where   q.,  in  y,   qpT  in  x.   As  rp  in  x 
we  have  qp^-,  in  x/.x  and  as  p,pq  in  Y.,   Q-i  j^-i  ii^  Y 
we  have  q^r-,  in  Y.y.\Y.y. 

Both  cases  lead  to  I(X.,x)  /  0,    a   contradiction.  || 

Corollary  5. 

If  G  is  an  R  separable  grammar  then  G  is  separable. 
This  is  really  a  corollary  to  the  definition  of  R  separability 

Corollary  6 . 

If  G  is  an  R  separable  grammar  then  for  all  X.  in  V  such 

that  there  exists  a  derivation  Z  =>  X,  ...  X  ,  n  >  2, 

1      n'    —   ' 

(X,,..,,X  )  is  R  separable. 

Definition 

We  say  an  n-tuple  (X    .  .  ,X  ),  n   >_  2 ,    is  L  separable  if  for 
all  i,  1  <  i  £  n,   (X   ...  X.  .,  ,  X.)  is  separable.   We  can 
extend  the  definition  to  nonterminals  and  grammars. 
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We  now  give  two  examples  which  Illustrate  separability. 

Example  1. 

Let  X  -*■   X^XpX    be  a  rule  of  some  grammar,  where 
X^  =  {a}*  =  X^  and  X2  =  (a). 
Then   (X,,Xp)  and  (Xp,X-,)  are  separable  but 

(X,,XpX^)  and  (X,Xp,X^)  are  not  separable. 
Thus  iX^yX^yX^)    Is  not  L  or  R  separable  (note  that  X  is 
ambiguous ) . 
Example  2. 

Let  X  be  as  above,  where  X,  =  {a,e},   Xp  =  {b,e}   and 
X^  =  {a}.   Then  X  is  both  L  and  R  separable  and  therefore 
separable . 

This  leads  to  the  following  lemma. 

Lemma  7. 

A  grammar  is  L  separable  if  and  only  if  it  is  R  separable. 

Proof:   This  is  not  included  as  it  is  very  similar  to  that 
used  in  Lemma  ^ .  | | 

Definition 

A  grammar  is  s (uper )-separable  if  it  is  R  and  hence  L 
separable . 

Corollary  8. 

If  G  is  an  s-separable  grammar  then  for  all  X.  in  V  such  that 

there  exists  a  derivation  Z  =>  X^  ...X  ,   n  >  2,  (X,  , . .  .  ,X^ ) 

i  n'  — '        1'         'J 

Is   s-separable   for  all   i,J,      l<i<n,i<J. 
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Remark. 

s-separability  formalizes  an  Intuitive  notion  of 
unamblguity,  i.e.  if  (X,Y)  is  separable  then  we  know  that 
there  is  no  subword  v,  such  that  there  exists  x  in  X,  y  in  Y 
such  that  XV  in  X  and  vy  in  Y.    Therefore  it  follows  that 
each  word  x  in  XY  can  be  uniquely  partitioned  into 
X-,Xp  with  X,  in  X,  Xp  in  Y.   Thus  s-separability  is  a  necessary 
condition  for  unambiguity,  that  it  is  not  sufficient  is  shown 
by  the  next  example. 

Example  3 • 

Let  G  =  ({S,X^,X2,Y^,Y2},{a,b},S,{S->X^Y^|X2Y2, 

X^^a,  X^^a,  Y^^b,  Y2^b}). 
G  is  trivially  s-separable  and  ambiguous. 

We  are  now  in  a  position  to  prove  the  following  necessary 
and  sufficient  condition  for  unambiguity. 

Theorem  9. 

A  grammar  G  is  unambiguous  if  and  only  if  the  following 
property  is  satisfied: 
(i)         for  all  X  in  N,  for  all  X  ^  x^,  X  ->  Xp  in  P 

x-j^  n  x^  =  0  ,   ^1  ^  ■^2* 
(ii)       G  is  s-separable.   This  is  Property  A. 

Proof:   As  sufficiency  follows  trivially  consider  necessity. 
Assume  the  property  holds  and  G  is  ambiguous.   There  exists 
at  least  one  word  x  in  L(G)  such  that  x  has  at  least  2  distinct 
derivation  sequences;  let  these  be  {v.}  and  {w.},   where 
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v.,  =  -A-,  =  .         =  w   =  X.   There  exists  J,  0  1  J  <  "^i"  (p»q) 
such  that 

v^  =  w^,  0  <  1  <  J   and  v^^^  ^   Wj^^  . 

Let  V.  =  aXv,   v.^,  =  ax,v,  w.^,  =  ax2V,   a  in  T  ,   X  in  N, 

V  in  V  ,  where  X  ->  x,,   X  -^  x-  in  P,   x^  ?^  x^. 

»  «         «  * 

Now  either  x,  =>  a,  ,   x^  =>  a^*   v  =>  ^i^^      a^,a2  in  T  ,  where 

X  =  a  a^ap.   Thus  x,  n  X2  ?^  0 

or         x^  =>  a^,   X2  =>  a^,   ^1,^1  ^"^  "^  >  l^i'  <  '^il  ^^y» 
and        V  =>  ap ,   v  =>  ap,    a2,ap  in  T  with  a,a2  =  a^a2. 
Thus  I(X,v)  ^   0,  which  by  Corollary  8  contradicts  the  second 
condition.   The  theorem  follows.  || 

Definition 

G  is  a  binary  grammar  if  for  all  X  -»■  x  in  P,  x  =  e  or 
X  =  X^X2,   X  in  N,    X-j^,X2  in  V. 

Schorre  (1965)  proved  the  following  result. 

Corollary  10. 

A  binary  grammar  is  unambiguous  if  and  only  if  the  following 
property  is  satisfied: 
(i)        for  all  X  in  N,  for  all  X  -►  x^,   X  ^  X2  in  P, 

x^  /  X2 ,   x^  n  X2  =  0. 
(11)       G  is  separable.   This  is  Property  B. 

Proof:    (X,Y)  is  s-separable  if  and  only  if  (X,Y)  is  separable, 
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Remark. 

The  proof  of  Theorem  9  can  be  followed  through  by  replacing 
s-separabillty  with  R  separability,  and  thus  Lemma  7  is  not 
necessary  for  this  proof.   Thus  a  direct  proof  of  Corollary  10 
would  be  much  shorter  than  that  originally  given  by  Schorre 
(1965).   Property  B  can  be  applied  whenever  at  most  two 
nonterminals  appear  together  in  the  right  sides  of  the  rules 
in  a  grammar,  so  we  obtain  the  following  result. 

Definition 

A  grammar  is  in  e-(k,2  )-normal  form,  k  >_  1   if  for  all 
X  ->  X  in  P,  either  x  in  T  ,  |  x  |  <  k   or   x  =  ay,  a  in  T  ,|al=k, 
y  in  N  vj  NN. 

It  is  known  (Wood,  1970),  that  every  grammar  can  be  put  in 
e-(k-2)  normal  form. 

Corollary  11. 

An  e-(k,2)  normal  form  grammar  is  unambiguous  if  and  only  if 
Property  B.  holds. 

Definition 

A  grammar  is  an  operator  grammar  if  for  all  X  ->-  x  in  P, 
X  in  T  (NT)   (Greibach,  I965). 

Corollary  12 . 

An  operator  grammar  is  unambiguous  if  for  all  X  in  N,  for 
all  X  ^  X-,  ,   X  -J-  Xp  in  P,  ^-\    ^   ^p'   ^1  '^  ^2  ~  '^ ' 
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3.   f-separablllty  and  k :separablllty 

Definition 

(X,Y)  is  f-separable  if 

f(X.\X)  r^  f(Y/.Y)  =  0  , 

where  f  is  a  mapping  from  T  into  some  set  D. 

Letting  f(u)  =  u  for  all  u  in  T  then  we  have  separability. 
We  extend  the  definition  to  give  f-s-separabillty  in  the  obvious 
way.   We  are  particularly  interested  in  the  following  special 
mapping. 

Definition 

(X,Y)  is  k: separable  if  k:(X.\X)  n  k:(Y/.Y)  =  0.   We  define 
k  :s-separable  in  a  similar  way;  let  Ij^(X,Y)  denote  the  set 
k:(X.\X)  n    k:(Y/.Y). 

Definition 

Given  a  nonnegatlve  integer  k,  a  grammar  G  is  a  U(k)  grammar 
if  the  following  property  is  satisfied: 
(i)        for  all  X  in  N,  for  all  X  •>  x^,   X  -»-  x^  in  P,  x^  /  x^y 

k:x,  r\  k:Xp  =  0. 
(ii)       G  is  k:s-separable  . 

Definition 

(X,Y)  is  f-q(uasl) -separable  if 

f(X.\X)  O  f(Y)  =  0  where  f  Is  a  mapping 
from  T"*"  into  some  set  D.   Similarly  define  k:q-separable 
and  k  :q-s-separable . 


-12- 


Definition 

Given  a  nonnegative  integer  k,  a  grammar  G  is  an  F(k)  grammar 
if  the  following  property  holds: 
(i)        for  all  X  in  N,  for  all  X  ->•  x  ,   X  ^  Xp  in  P,  x-,  ^   Xp 

k:  X-,  n  k  :  x„  =  0. 
(ii)       G  is   k :q-s-separable . 

F(k)  grammars  are  a  generalization  of  the  RCF  (regular  context 
free)  grammars  of  Tixier  (1967)  and  the  FCR  (first  character 
recognition)  grammars  of  Schorre  (1965). 

Definition 

A  language  is  a  U(k)  or  F(k)  language  if  it  is  generated  by 
a  U(k)  or  F(k)  grammar.   Let  ''/{-(k )  and  ^(k)  denote  the  families 
of  U(k)  and  F(k)  languages.   Further  let  %i   =    {L:  there  exists 
a  k  >  0,  such  that  L  is  in  'Z<^(k)}   and  similarly  define  ;r. 

Let  us  look  at  some  basic  properties  of  these  families. 
From  the  definitions  and  Theorem  9  we  have: 

Corollary  13. 

For  all  L ,    L   In    2i ,      L  is  unambiguous. 
However,  because  of  the  definition  of  q-separability  we  have: 

Letting  X  =  {c,£}  and  Y  ={  ca''"b''"c"^  ,  a'''b"'c'^  :  i,j  >_  1}  we 
have  {ca^b^c"^',  ca^b-^'c^":  i,j  >  1}  C  XY  and   (X,Y)  is 
2:  q-separable.   Further  as  both  X  and  Y  can  be  generated  by 
1:  q-s-separable  rules  we  obtain  the  following  result. 
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Theorem  1^ . 


^contains  (Inherently)  ambiguous  languages.   We  now  exhibit 
a  language  which  is  not  in  ~j- . 


Theorem  15 


L,  =  {a^,a^b^  :  i,J  >  1}   is  not  in  ^,  and  therefore  '^   is 
a  proper  subset  of  the  family  of  context-free  languages. 

Proof: 

If  G  is  such  that  L(G)  =  L,   then  G  must  have  two  disjoint 
cycles  and  therefore  at  least  three  nonterminals.   Otherwise 
L(G)  would  contain  words  of  the  form  a''"b'^  ,  i  >  j,  thus  we 
need  at  least  one  nonterminal  for  each  cycle  and  one  nonterminal 
that  branches  to  either  cycle,   let  this  be  X.   Then  X  has 
at  least  two  alternatives,  one  of  which  leads  to  words  of  the 
form  a  and  the  other  to  words  of  the  form  a  b  ;  let  these  be 
X  ->•  X,  and  X  -^  X2.   Then  for  any  k  >  0,  k:x-j^  n  kix^  =^0.   |  | 

Corollary  I6 . 

L,  is  not  in  '1)i  . 

It  turns  out  that  c^    and  ^    are  incomparable,  an 
unexpected  result. 

Theorem  17. 

L2  =  (a^b  C^  :  i  >  J  >  1}   is  in  ^  but  not  in  ^   . 

Proof:   The  rules   S-^AB,   A-»-aA|e,   B-^  aBc|b   are 
l:s-separable ;  therefore  Lp  is  in  ^(1). 

There  are  three  distinct  ways  In  which  L^  can  be  generated, 
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these  are: 

(a)  form  a'^a'^b  c"^ 

(b)  form  a'^a'^b  c"^ 

(c)  form  a-^'  (bje)"^'  . 

Each  of  these  can  be  shown  to  be  non-generable  by  an  F(k) 
grammar  for  any  k  >  0;  in  a  similar  manner  to  the  proof  of 
Theorem  15.   We  will  prove  (a)  only.   There  must  be  two 
disjoint  cycles,  one  to  generate  a""" ,  one  to  generate  a'^b  c"^ 
and  there  must  be  a  nonterminal  that  branches  to  both  cycles; 
so  that  a''"a'-^b  c"'  can  be  formed.   Let  this  be  the  rule  X  ^  AB, 
where  A  =  {a"^},   B  =  {a"''b  c"^}   then  A  .\  A  =  {a"""}  and  therefore 

k:  (A.\A)n  k:  B  ?^  0  for  any  k  >  0.  |  | 

We  now  show  every  LL(k)  grammar  is  U(k). 

Theorem  l8. 

For  all  k  >  0,  each  LL(k)  grammar  is  U(k). 

Proof: 

Each  LL(k)  grammar  trivially  satisfies  the  first  condition 
for  a  grammar  to  be  U(k),  therefore  it  remains  to  show  that 
each  LL(k)  grammar  is  k : s-separable .   Proceed  by  contradiction 
Given  G,  an  LL(k)  grammar  assume    it  is  not  a  U(k)  grammar. 
Then  there  exists  at  least  one  rule  Y  ->-xXy  such  that  (X,y) 
is  not  k:separable.   This  implies:   b  in  I,  (X,y). 
If  be  in  I(X,y),  c  in  T    then  G  is  ambiguous,  therefore  we 
must  have 
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X  A  X   a  ^ -y ,      c ,      r  , 

|b|  =  ;  ^  e. 

Thus 

abc  in  X  for  some  a  in  X  and 

bde  in  y  for  some  e  in  y. 
We  have 

Y  =>  xXy  =>  u^Xy  =>  u^ay,   u^  in  T  ,  and  (1) 

Y  =>  xXy  =>  u^Xy  =>  u^abcy,  (2) 

which  in  turn  implies  there  exists  Z  in  N  such  that 

«  » 

X  =>  a^Z  =>  a^z^  =>  a-|^a2  (3) 

« 

X  =>  a,Z  =>  a,Zp  =>  a-japb  c  ,  (^) 

where 

a^aj  =  a,   a^,a2  in  T  ,   ^i  '^  ^2'   ^  "^  ^1'   Z  ->  z^  in  P. 

» 
Thus  letting  S  =>  uYv,  combining  (1)  with  (3)  and  (2)  with 

(^4)  and  noticing  that  in  (1)  y  =>  bde,  we  have  that  Z 

is  not  LL(k) .  |  | 

This  leads  to  an  indirect  proof  of: 

Corollary  19 . 

Each  LL(k)  grammar  is  unambiguous. 
By  an  almost  identical  proof,  we  also  have: 
Corollary  20. 

For  all  k  >  0,  each  LL(k)  grammar  is  F(k). 

We  now  compare  ^(1),   '^^(1),   >(1). 
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Theorem  21. 

(i)        ^(1)  -  ^(1) 

(li)       ^il)    ^    tiil)    . 

Proof: 

(i)        Because  of  Corollary  20  we  need  only  show  that 
l:q-s-separability  implies  LL(1).   Assume  it  does  not. 
Then  given  G,  an  F(l)  grammar   there  exists  at  least  one 
X  in  N  which  is  not  LL(1).   Thus 

S  =>  uXv  =>  ux,v  =>  uu,  , 


S  =>  uXv'  =>  ux„v'  =>  u  U-,  ,    u,  u.  ,  u-j^  in  T  , 

15   X  ^  x^ 


* 

v,v'  in  V  ,   X  ->  X-,  ,   X  ^  x„  in  P, 


1:  U-,  =  1:  u.   but  x-.    /   x^    . 

(a)  e   in  X. 

Then  u-,=  e    and  Up=  e  implies  e  in  x,  r\  Xp  and  X-{e}C  X.\X 
therefore   l:u-,  in  1:(X.\X)  and  l:u-,  in  l:v 

(b)  e   not  in  X. 

1 :  X-,   or   1 :  Xp  7^  0. 

In  both  cases  we  have  a   contradiction. 

(ii)       L^  =  {a^Cb  bbd)^:  i  >  1}   is  known  to  be  LLC2)  but 
not  LLCD   (Rosenkrantz  and   Stearns,  1970).   It  can  be 
generated  by  a  UCl)  grammar,  however.   Let 

G  =  ({S,A,B,C},{a,b,d},S,P) 
where 
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P  =  {S  -►  aA, 

A  -  aAB|B, 

B  -*■   bC, 

C  ->■   bd|e}. 
As  B/.B  =  0  the  grammar  is  trivially  U(l),  but  not  F(l).  || 
We  have  the  weaker  characterization: 

Corollary  22. 

A  grammar  is  an  s-grammar  iff  it  is  an  e-free  U(l)  grammar. 

Because  L,  in  Theorem  15  is  deterministic  but  not  in  U 
we  are  lead  to  the  following  result. 

Theorem  23. 

U  and  the  family  of  deterministic  languages  are  incomparable 

Proof: 

R  * 

Let  L^  =  {ww  :  w  in  {a,b}  },   then  L_  is  not  deterministic 
but  it  is  generated  by  the  rules 

S  -*■   aSa|bSb|e  , 
which  are  trivially  U(l)  and  F(2).  || 

Corollary  2^. 

We  now  show  that  a  nontrivial  hierarchy  exists  for  both  the 
^(k)  and  ^(k)  families. 

Theorem  25. 

For  any  k  >  1,   L^(k)  =  (a^  (b  |  b^"*'-'-d  )^ :  i  >  1}  is  not  F(k). 
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Proof: 

Any  grammar  generating  L^ik)   must  have  a  cycle  for  some 
nonterminal   X,  say.   I.e. 

X  =>  uXv 
where   u  =>  a  ,   v  =>  (b|b    d)  ,  m  >  1.   This  Implies: 
b^d  In  X.\X   and  b^  In  k:v.   Thus 

k:  (X.\X)nk:v7^0. 
However   It  Is  F(k+1). 

Corollary  26. 

^(1)C  ^(i+1)  for  all  i  >  1. 

Theorem  27. 


For  any  k  >  1,   L  (k)  =  {a^  (d  |  b^d  |  db^"*"^  )^ :  i  >  1} 


k. I ,^k+lxl 

Is  not   U(k)  . 

Proof: 

As  in  Theorem  25  we  find  an  X  in  N  such  that 
X  =>  uXv,   b     in  X.\X   and   b   in  v/.v  ; 
thus   k:(X.\X)n  k:(v/.v)  /  0;   therefore 
L^(k)  is  not   U(k),   but  it  is   U(k+1).  || 

Operations  on  UCk)  and  F(k)  languages. 

By  noting  that  L-j^  =  {a^,a^b^:  i  >  1}   Is  neither  U(k) 
nor  F(k)  for  any  k  >  0,  but  that  it  is  the  union  of  two 
languages  which  are  both  U(l)  and  F(l),  we  can  construct 
the  various  non-closure  results  given  in  Table  1 ,  in  a  similar 
manner  to  those  given  for  LL(k)  languages  in  Section  5. 
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1 .   Relations  and  Context-Free  Grammars 

Korenjak  and  Hopcroft  (I966)  (henceforth  let  KH  denote 
this  reference)  solved  the  equivalence  problem  for  s-grammars 
by  considering  a  relation  on  V   (in  fact,  an  equivalence 
relation).  Attempts  since  then  have  been  made,  without  success, 
to  extend  this  notion  to  solve  the  equivalence  problem  for 
LL(k)  grammars,  although  this  problem  has  been  solved  by  a 
different  route  in  Rosenkrantz  and  Stearns  (1970).   Tixier 
extended  the  relation  to  LL(1)  grammars;  we  now  extend  it 
to  unambiguous  grammars  and  languages. 

Definition. 

A  nonterminal  X  in  N  has  the  prefix  property  if 
X  =>  X  =  yz  {z   ^   e)      then  X  />    y. 

The  following  result  is  given  in  KH. 

Lemma  28. 

Jiven  G,  an  s-grammar,  every  X  in  N  has  the  prefix  property. 

If  s  has  the  prefix  property  then  L(G)  is  said  to  have 
the  prefix  property.   Therefore  every  s-language  has  the 
prefix  property. 

Definition. 

If  X  is  a  set  of  words  let  sh(X)  denote  the  length  of  a 
shortest  word  in  X. 

Definition. 

A  grammar  G  is  in  (1,2) -normal  form  if  for  all  X  -*■  x  in  P 
either  x  in  T,  x  in  TN  or  x  In  TNN. 
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Definition. 

* 

If  X,  Y  are  sets  of  words  from  T   and  fall  all  x,  x  In  X 

If  and  only  if  x  in  Y  then  write  X  e  Y  (1 .e .  X  e  Y  if  and 

only  if  X  =  Y).  We  extend  this  to  the  catenation  product 

of  sets  of  words  and  to  words  over  V  (i.e.  x  =  y  if  and 

only  if  X  E  y).  We  say  X  =  Y  is  an  equivalence  pair. 

As  in  KH  we  have : 

Lemma  29 . 

The  relation  '='  is  a  congruence  relation  under  catena- 
tion product . 

Proof: 

(i)  reflexive:  X    =   X  as   X  =  X 

(ii)  symmetric:  X  E  Y  implies   Y  =  X 

(ill)  transitive:  X  =  Y  and   Y  e  Z   implies  X   =    Z 

(iv)  catenation:  X   =   Y  and  W  =  Z   implies   XW  =  YZ . 

Considering  the  corresponding  sets  of  words  (i)-(iv) 
follow  trivially.  | 

Remark 

Note  that  the  above  lemma  holds  for  any  sets  of  words, 
however  they  are  generated. 

We  now  assume  that  for  any  set  X,  X  is  nonempty. 

Definition 

We  say  n(X^,...,X  )  iff  (X^,...,X^)  is  s-separable. 

Lemma  30 . 

If  n(X,Z),   n(W,Y),   W  =  X  and  WY  =  XZ   then   Y  =  Z. 
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Proof: 

,, ._  :  :        re  exists  a  shortest  word  a 

that  cont •       Y   =   Z; without  loss  of  generality  assume 
a  In  Y. 

Let  b  be  a  shortest  word  in  W  (and  hence  in  X);  then 

ba  in  WY  and  ba  in  XZ. 
Now  no  proper  prefix  of  b  can  be  in  X,  by  construction, 
therefore  ba^  in  X,  a^   in  Z,   a-,^a2  =  a,  a^   /   c .      Further 
ap  in  Y  as  a  is  a  shortest  word  that  contradicts  Y  h  Z. 
Therefore  b,ba,  in  W,   a^,a2,  a2  in  Y  contradicts  n(W,Y).|| 

Corollary  31.   Left  cancellation. 

If  n(W,Z),   n(W,Y)  and  WY  =  WZ   then  Y  =  Z. 

Corollary  32. 

If  n(X,Z),   n(W,Y),   Y  E  Z  and  WY  5  XZ   then    W  =  X. 

Corollary  33.   Right  cancellation. 

If  n(W,Y),   n(X,Y)   and  WY  =   XY   then  W  h  X. 

A  useful  operation  is  substitution  of  equivalences, 
which  preserves  equivalences. 

Lemma  3^ . 

If  n(X,Y),   n(W,Y)   and   XY  =  Z   then  X  E  W   iff  WY  = 

Proof: 

i£:     XY  H  Z  and  WY  =  Z  then  XY  =   WY  and  X  =  W. 
only  if:    X  =  W  and  Y  =  Y  then  XY  =  WY  giving  WY  E  Z .     | | 

KH  give   two  transformations  on  equivalence  pairs;  we 
extend  these  in  a  natural  way. 
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Definition .   The  A-trans formation . 

Given  an  e-free  set  X  we  let  X(a)  denote  the  subset  of 

X  defined  as   (u-,  :  u  in  X ,  u  =  au-,  } . 

Given  the  equivalence  pair  X,  .  .  .X   e  Y-,...Y    form 
^  ''^      1     n     1    m 

X-,  (a)  and  Y-,  (a)  for  all  a  in  T.   We  replace  the  equivalence 
pair  by  a  set  of  new  equivalence  pairs 

X^(a)X2...X^  E  Y^(a)Y2...Y^   for  all   a   in  T. 

We  have  trivially: 

Lemma  35 . 

In  the  above  definition 

X, ...X   E  Y, ...Y   iff  for  all  a  in  T 
1     n     1    m 

X^(a)  X^"  '    ^n  -  ^1^^^  ^2-  ••  ^m- 

Remarks . 

(i)        Note  that  X(a)  =  X\{a}. 

(ii)        If  the  equivalence  pair  is  over  N  x  n   then 

the  A-transformation  is  carried  out  as  a  left 
substitution,  followed  by  a  collecting  of  terms, 
as  more  than  one  alternative  of  X,  may  begin 
with  a  specific  terminal  symbol. 

(ill)       Note  that  X, (a)  or  Y. (a)  for  some  a  in  T  may 
contain  the  empty  word.   For  s-languages  (and 
grammars)  we  have  e  in  X  (a)  iff  e  in  Y-,(a)  for 
any  a  in  T.   However  this  is  obviously  not  true 
for  LL(k)  languages  (and  grammars)  in  general. 
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We  now  generalize  the  B-transformation  of  KH. 

Definition.   The  B-trans format Ion. 

Let  X^...X^  E  Y^...Yj^,  a  In  X^  and  aZ  C  Y^...Y^y   £  >  1 , 
for  some  set  Z  such  that  there  exists  no  set  Z^,   Z  C  Z^ 
and  aZ^  C  Y^...Y^,   then  replacing 

X2---Xn  =  ZY,^,...Y^  and   X^Z  E  Y^...Y, 
we  have  the  B-transformation. 

Theorem  36 . 

^l-'-^n  ^  ^l-'-^m  ^'^ 

^2'--\   -  2Y^+i...Y^   and   X^Z  =    Y^...Y^. 

Proof:   As  in  KH. 

Further,  s-separabllity  Is  preserved.  I | 

Considering  equivalences  on  s-grammars  we  have  the 
following  corollaries. 

Noting  that  Z  will  have  the  form  Z^...Z    Z^  in  N,  p  >  0, 
we  have : 

Corollary  37. 

0    <    IpI    <    |a|    +    1,    if  Y^...Y      fit   ay,      for     q   <    a. 

Definition. 

Given  a  grammar  G.,  let 
t.  =  max  ({sh(X)   :  X  in  N^})  and  let  t  =  max({t^}),  all  1. 

This  leads  to  the  following  corollary. 
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Corollary  38. 

If  X-,  .  .  .X   E  Y-,  .  .  .Y   is  an  equivalence  pair  on  s-grammars 
1    n    1    m 

G-,  and  Gp,  then 

(i)        shU^...X^)  =  sh(Y^...Y^) 

(li)       1  <_  m  <_  nt ,   i.e.  the  length  of  the  right  side 
is  bounded  by  the  length  of  the  left  side. 

Then  we  have  the  special  case  of  KH. 

Corollary  39. 

If  n  <_  t  +  3  then  the  left  sides  generated  by  the  B-trans- 
formation  have  length  at  most  t+2,  and  therefore  the  right 
sides  have  length  at  most  t(t+2). 

Definition . 

Given  two  sets  X,Y  then  we  say: 
(i)        X  •<  Y  ,  X  is  left  string  contained  in  Y, 

if  for  all  X  in  X,  xy  in  Y  for  some  y  in  T  , 
(ii)       X  •>  Y,   X  left  string  contains  Y, 

if  for  all  X  in  X,  there  exists  y  in  Y  and  z  in  T 

such  that  X  =  yz, 
(iii)       X  <•  Y,  X  is  right  string  contained  in  Y, 

if  for  all  X  in  X,  yx  in  Y  for  some  y  in  T  , 
(iv)       X  >•  Y,  X  right  string  contains  Y, 

if  for  all  X  in  X,  there  exists  y  in  Y  and  z  in  T  , 

such  that  X  =  zy . 

Similarly  we  can  define  x  •<,  •>,  <*,  >•  y  for  words  x,y  in  T 
We  now  have  the  following  lemma. 
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■a  ^0. 

Given  X^.  -i^2»                                -^^y » 
:e : 

(i)  if  sh(X^)  =  sh(Y^)   then  X^  e  x 

(II)  if  sh(X^)  <  sh(Y^)   then  X^  •<  .^ 

(III)  if  sh(X^)  >  sh(Yj^),  then  X^  •>  Y^  . 

Proof: 

Let  x,,x^,  yi'^?   ^^  shortest  words  in  X,,Xp,Y,,Yp, 
and  X,  ,Xp,Y,  ,Yp  be  the  corresponding  sets  of  shortest  words. 
(i)        Choose  the  shortest  word  which  contradicts 
X,  =   Y,,  let  this  be  x-^.      It  follows  that 

X^  =  Y^   and   X2  =  Y2. 
Now  x,X2  in  Y^Yp.   Thus  we  have  an  immediate  contradiction, 
therefore  X,  =   Y,. 

(ii)       We  have  X^  '<   ^i   and  X2  >•  Y2 . 

Let  X,  be  the  shortest  word  in  X,  such  that  there  exists  no 
y  in  T   such  that  x,y  in  Y, .   Immediately  we  have 

x-,X2  in  Y,Y2  implies  x,-,  in  Y,,  ^ip^?  ^^  ^2 
where  x,,x-,2  =  ^-i  j  giving 

^11^2  ^^   ^1^2  ^^^   ^^   ^2  ^'  "^2   "^  have 

v.  In  X,,   ^272  in  Xp,  where  v-,V2  =  x-it>  givinp; 

a  contradiction  of  the  prefix  condition. 

(Hi)      is  proved  in  a  similar  way  to  (ii).  || 

We  have  in  fact  a  stronger  result. 
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Theorem  ^1. 

Given  sets  X,  ,X„,Y, ,Yp  with  the  prefix  property, 
sh(X-j_)  =  sh(Y-^)  and  shCX^)  =  sh(Y2)   then 

X^Xp  =  Y^Y2   iff  X^  s  Y^   and  X^    e  Y2. 
The  following  question  arises. 

Does  Theorem  4l  hold  without  the  prefix  property? 

Example  1 . 

Let  X-j^  =  {ab^  :  i  >  O},   X^  =  id} 

Y^  =  {a} ,   Y^  =  {b^d  :  i  >  0} 

X-,  does  not  have  the  prefix  property. 

sh(X-j_)  =  sh(Y-^),  shfXp)  =  sh(Y2)   and   X^X^  =  Y^Y2 

but     X^  ^  X2   and  Y^  ^  Y2. 

The  most  we  can  say  is: 

Lemma  42. 

Given  sets  X^,X2,Y-^  Y2   with  sh(X^)  =  sh(Y-^) 
and  sh(X2)  =  sh(Y2)  then  X-^X2  =  Y-|_Y2  only  if  X.^  e  Y-j_ 
and  Y^  E  Yp. 

This  is  just  a  trivial  restatement  of  the  catenation 
property . 
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..E    I 


Comparison  of  LL(k),    F(k),    U(k)    ana   ut- i.trr.' ilniatic   Languages 


Closed  Under 

LL(k) 

DPDL 

U(k)  or  F(k) 

Union 

no 

no 

no 

Concatenation 

no 

no 

no 

Concatenation  with  R 

no 

yes 

no 

Closure 

no 

no 

no 

Reversal 

no 

no 

no 

Intersection 

no 

no 

no 

Complement 

no 

yes 

7 

Intersection  with  regular  set 

no 

yes 

no 

Substitution 

no 

no 

no 

e-free  substitution 

no 

no 

no 

Gsm   mappings 

no 

yes 

no 

e-free  gsm  mappings 

no 

yes 

no 

Inverse  deterministic  gsm 

mappings 
Quotient  with  regular  set 

no 
no 

yes 
yes 

no 
no 

Homomorphism 

no 

n;"' 

no 

e-free  homomorphism 

no 

I;. 

no 
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5 .   Operations  and  LL(k)  Languages 

In  Table  I  we  compare  LL(k)  languages  with  the  deterministic 
context-free  languages  of  Glnsburg  and  Greibach  (1966).   The 
results  for  LL(k)  languages  are  non-closure  results;  this  with 
the  known  result  (Rosenkrantz  and  Stearns,  1970)       that 
the  LL(k)  languages  form  the  largest  known  class  for  which 
the  equivalence  problem  is  decidable,  their  m.isbehavior  is 
surprising.   Most  of  the  results  detailed  below  appeared 
previously  in  Korenjak  and  Hopcroft  (1966),  Rosenkrantz 
and  Stearns  (1970),  Tixier  (1967)  and  Wood  (1969b). 

Boolean  Operations. 

Lemma  ^3. 

(^  is  not  closed  under  (i)  union,   (ii)  intersection, 
(iii)  complement. 

Proof: 

(i)        Let  L^  =  {a^:  i  >  l},  L^   =    (a^b^:  i  i  1} 

then  L  U  Lp  is  not  LL(k)  for  any  k  >  0. 
(ii)       Let  L^  =  {a^(b|c)  a^(b|c)  :  i  >  l}, 

L^   =  (a'^b  a'^b,  a"'"c  a*^  c  :  i,j  >  1}  , 

then  L^   =   L^  n   L^    =    ia^b    a"^b ,  a'^c  a''"c:  i,j  >_  1} 

which  is  not  LL(k)  for  any  k. 
(iii)      Let  Lg  =  {a^b^ :  j  >  i  1  D 

then  {a,b}   -  Lg  is  not  LL(k)  for  any  k, 

this  is  proved  in  Rosenkrantz  and  Stearns  (1970).  || 
Because  L  and  L^  are  regular  sets  we  have  the  following. 
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Corollary  ^^. 

^  is  not  closed  under  union  or  Intersection  with  a 
regular  set. 

Letting  L„  =  (a^b  a-^c,  a^c  a"^b  :  l,j  >  1}   then 

L,3-Lg  =  Lc  and  as  {a,b}   -  Lg  is  not  LL(k)  we  have 

Corollary  ^5. 

is  not  closed  under  subtraction  or  subtraction  with 
a  regular  set. 

Mappings . 

Let  Lg  =  cL,  Kj   dLp,   then  Lg  is  LL(1). 
Define  a  homomorphism  a,  that  maps  d  onto  c  and  the  other 
symbols  onto  themselves,  then 

a(Lg)  =  Lg  =  {ca^jca^b^:  i  >  1)  which  is  not  LL(k) 
for  any  k.   We  have  shown 

Lemma  ^  6 . 

^  is  not  closed  under  e-free  homomorphism. 
Further  as  homomorphism  is  a  special  case  of  substitution 
we  have 

Corollary  ^7. 

^   is  not  closed  under  e-free(finite)  substitution, 
homomorphism  or  (finite)  substitution. 

Similarly  we  can  define  a  gspi   mapping  that  performs  the 
homomorphism  o,  therefore  we  also  have 
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Corollary  ^8. 

^  is  not  closed  under  e-free  gsm  mappings. 

Even  when  these  non-closure  results  hold  it  is  usual  for 
Inverse  mappings  to  give  closure,  however  we  have: 

Lemma  ^9 . 

^  is  not  closed  under  inverse  e-free  homomorphism  or 
inverse  e-free  gsm   mappings. 

Proof: 

Define  the  homomorphism  a-^      (gsm   mapping)  that  maps 
a  and  b  onto  themselves  and  c  onto  b. 

Then   a-,(Lr-)  =  (a'^b  a"^b  :  i  >  1}  which  is  LL(k)  but 
Id  — 

L^  is  not  LL(k) .  I  | 

Finally  we  note  that  as  <^  is  not  closed  under  e-free 
homomorphism,  it  is  not  closed  under  k-limited  erasing. 

Products  and  Quotients. 

Let   L,Q  =  {c}  U  cLp   which  is   LL(1);  then 

L-j_qL   =  {ca^,  ca^b^a^'  :  i,j  >  1}   is  not  LL(k). 

Trivially  letting  L-,-,  =  L-,  u  CL2  we  have 

{c,cc}L   =  {ca^,cca^,  cca-^b^,  ccca^b^:  i  >  l)   is  not  LL(k) 
We  have  just  shown  the  following. 

Lemma  50 . 

^   is  not  closed  under  product,  pre-product  with  a 
regular  set  or  post-product  with  a  regular  set. 
We  have  in  fact  shown  a  stronger  result  for  RL,  namely: 


-31- 


^  •     •  ^    1  under  pre-product   "'  a  finit~  -" . 

However  we    :  ive  (the  proof  left  to  the  reader) 

Lemma  52. 

^  is  closed  under  post-product  with  a  finite  set. 

Let   L,  2  =  L,  v^  cLp  u  {c}   we  have  that 

« 
L,p  contains  strings  of  the  form 

ca'^x  and  ca'^b^y  which  means  that  L,p  is  not   LL(k); 

this  Biives  : 

Lemma  53- 

J_   is  not  closed  under  catenation  closure. 

L,  A{c,£}  =  L- ,  U  Lp   and  is  therefore  not  LL(k)  giving 
an  expected  result: 

Lemma  5^. 

^  is  not  closed  under  left  quotient,  left  quotient  with 
a  regular  set  or  left  quotient  with  a  finite  set. 

Tixier  (196?)  has  shown  the  strongest  possible  result: 

Lemma  55. 

<^  is  closed  under  left  quotient  with  a  single  word. 

Letting  L^^  =  L^/  ((ab'^  :  i  >  1}    {e}  )  this  gives 
{a^b^ja^"*'-'-  :  i  >  1}  which  is  not  LL(k)  for  any  k. 
Therefore  we  have 

Lemma  56. 

^   is  not  closed  under  right  quotient  or  right  quotient  with 
a  regular  set. 


However,  as  expected  by  the  postproduct  result: 

Lemma  57. 

^  is  closed  under  right  quotient  with  a  finite  set. 

We  now  examine  those  operations  introduced  by  Glnsburg 

and  Greibach  (1965)  which  preserve  the  deterministic  languages. 

Definition. 

Init(L)  =  {u  :  uv  e  L  for  some  v  e  T  1,  thus 

Init(L)  =  L/T  ,  the  set  of  all  initial  subwords  of  words  m  L, 

Taking  L  =  {a^b^:  1  >  l),   Init(L)  =  (a^b'^' :  i  1  j  >  0} 

which  is  not  LL(k)  for  any  k,  giving 

Lemma  58 . 

Inlt  does  not  preserve  the  LL(k)  condition. 

Definition. 

RDlVCL-^jL^)  =  {u  :  uL^  C  L-^}. 

Lemma  59 . 


RDIV  does  not  preserve 
Proof: 


^, 


Let   L^  =  {a   b  (ab^c|c)  :  i,j  >  1}   and 


L2  =  {a  b^c  :  i  >  1} 
then  RDIV(L   L^)  =  {a^  ,a"^''"-''b^  :  i  >  1}  which  is  not  in  ^. 

Miscellaneous  Operations. 
Lemma  60. 


^tL   is  not  closed  under  reversal. 


Proof:   Let  L  =  (L  U  L^)''^  which  is  LL(1)   but  l'^  is  not  LL(k) 
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